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Nucleic Acids, Proteins, and Antibodies 

i ■ 
" ' A .- ' ' 

( 

[1] This application refers to a "Sequence Listing" that is provided only on electronic 
media in computer readable form pursuant to Administrative Instructions Section 801(a)(i). 
The Sequence Listing forms a part of this description pursuant to Rule 5.2 and 
Administrative Instructions Sections 801 to 806, and is hereby incorporated in its entirety. 
[2] The Sequence Listing is provided as an electronic file (PTZ13PCT_seqList.txt,. 

5,421,455 bytes in size, created on January 13, 2001) on four identical compact discs (CD- 
R), labeled "COPY 1," "COPY 2," "COPY 3," and "CRF." The Sequence Listing complies 
with Annex C of the Administrative Instructions, and may be viewed, for example, on an 
IBM-PC machine running the MS-Windows operating system by using the V viewer 
software, version 2000 (see World Wide Web URL: http://www.fileviewer.com). 

Field of the Invention 

[3] The present invention relates to novel proteins. More specifically, isolated 

nucleic acid molecules are provided. encoding novel polypeptides. Novel polypeptides and 
antibodies that bind to these polypeptides are provided. Also provided are vectors, host 
cells, and recombinant and synthetic methods for producing human, polynucleotides and/or 
polypeptides, and antibodies. The invention further relates to diagnostic and therapeutic 
methods useful for diagnosing, treating, preventing and/or prognosing disorders related to 
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these novel polypeptides. The invention further relates to screening methods for identifying 
agonists and antagonists of polynucleotides and polypeptides of the invention. The present 
invention further relates to methods and/or compositions for inhibiting or enhancing the 
production and function of the polypeptides of the present invention. 

Background of the Invention 

14] Enzymes comprise a large subset of proteins which function as catalysts for 
biochemical reactions. In fact, virtually every biochemical reaction involves the catalytic 
activity of an enzyme or enzymes. Most enzymes are located intracellularly, but there are a 
number of enzyme families which are either secreted into the extracellular space, or 
associated with the plasma membrane. Some enzymes, including the secreted digestive 
enzymes trypsin and pepsin, are produced as inactive precursors called zymogens, which 
require chemical modification to become active. In many cases, the catalytic activity of an 
enzyme depends on its association with a cofactor. Cofactors may, be organic molecules, 
termed coenzymes, or metal ions. Many coenzymes are derived from vitamins. 
15] Enzymes contain two important functional units: the substrate binding site, and 

the catalytic site. The substrate-binding site consists of a cleft which is geometrically 
complementary to the shape of the preferred substrate. In addition, the amino acid residues 
which form the substrate binding site have noncovalent interactions with the amino acids of 
the complementary substrate region. The catalytic site is the portion of the molecule that 
facilitates the biochemical reaction once the substrate is bound to the enzyme. For a more 
extensive discussion of enzyme properties, see Biochemistry. Voet and Voet (1990); and 
Molecular Cell Biology. 2 nd Edition, Darnell et al. (1990). 

[6] The International Union of Biochemistry and Molecular Biology (IUBMB) has 
established enzyme nomenclature guidelines to provide an organizational framework to the 
growing field of enzymology (see Enzyme Nomenclature, Academic Press (1992), or the 
IUBMB Nomenclature Committee web site at the URL address: http://www.chem.qmw. 
ac.uk/iubmb/enzyme ). According to the IUBMB guidelines, all enzymes can be 
categorized by the chemical reaction they catalyze. Documented enzymes are assigned an 
identifier which takes the form of "EC [AB.C.DY\ where A is one of the major functional 
classes of enzymes (1 through 6; see below); B and C designate increasingly specific 
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subgroups of enzymatic reactions; and D represents the arbitrary number of an individual 
member of a given category. As an example, the enzyme acetylcholinesterase is designated 
EC 3. LI. 7, because it is a member of the class of hydrolases (EC 3.-.-.-), acting on ester 
bonds (EC 3.1.-.-), in the subgroup of carboxylic ester hydrolases (EC 3.1.1.-), and it is 
assigned number 7. Descriptions of the six major functional classes, including notable 
examples of each, follow below. 

Oxidoreductases (EC 1 .-.-.-) 

[7] Enzymes of this class catalyze oxidation-reduction reactions. Sub-classification 
is according to the substrate group oxidized (e.g., CH-OH, CH-CH, and CH-NH2, to name 
but a few). A representative member of this enzyme class is long-chain-alcohol 
dehydrogenase (EC 1.1.1.192), which is involved in lipid metabolism. Deficient activity of 
this enzyme has been shown to be the primary cause of Sjogren-Larsson syndrome, an 
autosomal recessive disorder characterized by the presence of ichthyosis, mental 
retardation, and spasticity (Rizzo etal t J. Clin. Invest 81: 738-744 (1988). 

Transferases (EC 2.-.-.-) 

[8] Catalytic reactions of transferases are characterized by the transfer of a chemical 

group from a "donor" molecule to an "acceptor" molecule. Transferases can be subgrouped 
according to the chemical group transferred. For example, amino transferases (EC 2.6.-.-) 
transfer nitrogenous groups, and methyltranferases (EC 2.1.1.-) transfer methyl groups. 
Often the transferred group is donated by a coenzyme. A major subgroup of transferase 
enzymes are the protein kinases (EC 2.7.-.-), which catalyze the transfer of a phosphate 
group from ATP to a substrate protein. Protein kinases, such as calcium/calmodulin 
dependent (CaM) kinase H (EC 2.7.1.123), are known to play important roles in signal 
transduction pathways (Kennedy , Brain Res Brain Res Rev 26(2-3):243-57 (1998)). Other 
transferases are involved in metabolic processes. For example, guanidinoacetate N- 
methyltransferase (GAMT; EC 2.1.1.2), converts guanidinoacetate into creatine, which is 
essential for the maintenance of energy reserves .in the form of ATP. GAMT deficiency 
causes neurological impairments which may include progressive extrapyramidal movement 
disorders, seizures, developmental delay, and muscular dystonia (Stockier et al., Pediat 
Res. 36:409-413 (1994). 
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Hydrolases (EC 3.-.-.-) 

[9] Enzymes of this class catalyze the splitting of a substrate into two fragments by 

the addition of a water molecule; the water's hydroxyl group being incorporated in one 
fragment and the hydrogen atom in the other. Hydrolases can be subcategorized according 
to the chemical bond involved. For example, peptidases (EC 3.4.-.-; also known as 
proteases) are hydrolases which catalyze the breaking of peptide bonds. Pepsin (EC 
3.4.23.1), a digestive protease which has been implicated in a number of gastrointestinal 
disorders, is an example of a proteolytic hydrolase enzyme (see, for example, Hirschowitz, 
Yale J. Biol Med 72(2-3): 133-43 (1999), and Del Bianco et al., Dig. Liver Dis. 32(l):12-9 
(2000)). Deficient activity of beta-glucocerebrosidase (EC 3.2.1.45), an O-glycosyl 
hydrolase, is associated with Gaucher* s disease. Symptoms of Gaucher' s disease include 
bone lesions, skin pigmentation, enlargement of the liver and spleen, and, in some cases, 
neurological impairments. 

Lyases (EC 4.-.-.-) 

[10J Lyases cleave C-C, C-O, C-N, and other bonds by means other than hydrolysis or 
oxidation. The reverse reaction is performed by a synthetase. Histidine decarboxylase (EC 
4.1.1.22) is a carboxy-lyase that converts histidine to histamine, a biogenic amine involved 
in a number of physiologic processes, including inflammation, allergic responses, 
neurotransmission, and gastric acid secretion. The phosphorus-oxygen lyase, adenylate 
cyclase (EC 4.6.1.1), is an intracellular enzyme which acts on ATP to form adenosine 3* ,5* 
-cyclic phosphate (c AMP), a second messenger activator of protein kinase activity. 

Isomerases (EC 5.-.-.-Y 

[11] Members of this class of enzymes catalyze geometric or structural changes within 
a molecule to form an isomer. Subclasses of isomerases include racemases / epimerases 
(EC 5.1.-.-.), cis-trans- isomerases (EC 5.2.-.-), intramolecular isomerases (EC 5.3.-.-), 
intramolecular transferases (EC 5.4.-.-), and intramolecular lyases (EC 5.5.-.-). Protein 
disulfide isomerase (PDI; EC 5.3.4.1) catalyzes the intramolecular rearrangement of 
disulfide bonds, thus contributing to the folding of newly-synthesized proteins at the 
endoplasmic reticulum (see, for example, Luz and Lennarz, EXS 77:97-117 (1996)). 
Autoantibodies to PDI have been implicated in hepatic disorders (Nagayama et al., J 
Toxicol Sci Aug;19(3):163-9 (1994)). 
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Ligases (EC 6.-.-.-) 

[12] Ligase enzymes catalyze the formation of a bond between two substrate 
molecules, coupled with the hydrolysis of a pyrophosphate bond in ATP or a similar 
triphosphate. A well characterized example of this class is DNA ligase 1 (EC 6.5.1.1), 
which catalyzes the joining of DNA fragments (via the formation of a phosphodiester 
bond) during DNA replication, recombination, and repair. Mutations in the gene encoding 
DNA ligase 1 have been linked to immunodeficiency disorders and hypersensitivity to 
DNA-damaging agents (Barnes et aL, Cell, 69, 495-503 (1992)). 

[13] The discovery of new human enzyme polynucleotides, the polypeptides encoded 
by them, and antibodies that immunospecifically bind these polypeptides, satisfies a need in 
the art by providing new compositions which are useful in the diagnosis, treatment, 
prevention and/or prognosis of a range of conditions, including but not limited to cancer, 
immunodeficiencies, neurological disorders, and metabolic disorders. 



Summary of the Invention 
[14] The present invention relates to novel proteins. More specifically, isolated 
nucleic acid molecules are provided encoding novel polypeptides. Novel polypeptides and 
antibodies that bind to these polypeptides are provided. Also provided are vectors, host 
cells, and recombinant and synthetic methods for producing human polynucleotides and/or 
polypeptides, and antibodies. The invention further relates to diagnostic and therapeutic 
methods useful for diagnosing, treating, preventing and/or prognosing disorders related to 
these novel polypeptides. The invention further relates to screening methods for identifying 
agonists and antagonists of polynucleotides and polypeptides of the invention. The present 
invention further relates to methods and/or compositions for inhibiting or enhancing the 
production and function of the polypeptides of the present invention. 
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Detailed Description 

Tables 

[151 Table 1 A summarizes some of the polynucleotides encompassed by the invention 
(including cDNA clones related to the sequences (Clone ID NO:Z), contig sequences 
(contig identifier (Contig ID:) and contig nucleotide sequence identifier (SEQ ID NO:X)) 
and further summarizes certain characteristics of these polynucleotides and the polypeptides 
encoded thereby. The first column provides the gene number in the application for each 
clone identifier. The second column provides a unique clone identifier, "Clone ID NO:Z", 
for a cDNA clone related to each contig sequence disclosed in Table 1A. The third column 
provides a unique contig identifier, "Contig ID:" for each of the contig sequences disclosed 
in Table 1A. The fourth column provides the sequence identifier, "SEQ ID NO:X", for 
each of the contig sequences disclosed in Table 1 A. The fifth column, "ORF (From-To)", 
provides the location (i.e., nucleotide position numbers) within the polynucleotide 
sequence of SEQ ID NO:X that delineate the preferred open reading frame (ORF) that 
encodes the amino acid sequence shown in the sequence listing and referenced in Table 1 A 
as SEQ ID NO:Y (column 6). Column 7 lists residues comprising predicted epitopes 
contained in the polypeptides encoded by each of the preferred ORFs (SEQ ID NO:Y). 
Identification of potential immunogenic regions was performed according to the method of 
Jameson and Wolf (CABIOS, 4; 181-186 (1988)); specifically, the Genetics Computer 
Group (GCG) implementation of this algorithm, embodied in the program 
PEPTIDESTRUCTURE (Wisconsin Package vlO.0, Genetics Computer Group (GCG), 
Madison, Wise ). This method returns a measure of the probability that a given residue is 
found on the surface of the protein. Regions where the antigenic index score is greater than 
0.9 over at least 6 amino acids are indicated in Table 1A as "Predicted Epitopes*', hi 
particular embodiments, polypeptides of the invention comprise, or alternatively consist of, 
one, two, three, four, five or more of the predicted epitopes described in Table 1A. It will 
be appreciated that depending on the analytical criteria used to predict antigenic 
determinants, the exact address of the determinant may vary slightly. Column 8, "Tissue 
Distribution" shows the expression profile of tissue, cells, and/or cell line libraries which 
express the polynucleotides of the invention. The first number in column 8 (preceding the 
colon), represents the tissue/cell source identifier code corresponding to the key provided in 



6 



WO 01/55301 



PCT/US01/01239 



Table 4. Expression of these polynucleotides was not observed in the other tissues and/or 
cell libraries tested. For those identifier codes in which the first two letters are not "AR", 
the second number in column 8 (following the colon), represents the number of times a 
sequence corresponding to the reference polynucleotide sequence (e.g., SEQ ID NO:X) was 
identified in the tissue/cell source. Those tissue/cell source identifier codes in which the 
first two letters are "AR" designate information generated using DNA array technology. 
Utilizing this technology, cDNAs were amplified by PGR and then transferred, in duplicate, 
onto the array. Gene expression was assayed through hybridization of first strand cDNA 
probes to the DNA array. cDNA probes were generated from total RNA extracted from a 
variety of different tissues and cell lines. Probe synthesis was performed in the presence of 
33 P dCTP, using oligo(dT) to prime reverse transcription. After hybridization, high 
stringency washing conditions were employed to remove non-specific hybrids from the 
array. The remaining signal, emanating from each gene target, was measured using a 
Phosphorimager. Gene expression was reported as Phosphor Stimulating Luminescence 
(PSL) which reflects the level of phosphor signal generated from the probe hybridized to 
each of the gene targets represented on the array. A local background signal subtraction was 
performed before the total signal generated from each array was used to normalize gene 
expression between the different hybridizations. The value presented after "[array code]:" 
represents the mean of the duplicate values, following background subtraction and probe 
normalization. One of skill in the art could routinely use this information to identify normal 
and/or diseased tissue(s) which show a predominant expression pattern of the corresponding 
polynucleotide of the invention or to identify polynucleotides which show predominant 
and/or specific tissue and/or cell expression. Column 9 provides the chromosomal location 
of polynucleotides corresponding to SEQ ID NO:X. Ghromosomal location was determined 
by finding exact matches to EST and cDNA sequences contained in the NCBI (National 
Center for Biotechnology Information) UniGene database. Given a presumptive 
chromosomal location, disease locus association was determined by comparison with the 
Morbid Map, derived from Online Mendelian Inheritance in Man (Online Mendelian 
Inheritance in Man, OMIM™ McKusick-Nathans Institute for Genetic Medicine, Johns 
Hopkins University (Baltimore, MD) and National Center for Biotechnology Information, 
National Library of Medicine (Bethesda, MD) 2000. World Wide Web URL: 
http://www.ncbi.nlm.nih.gov/omim/). If the putative chromosomal location of the Query 
overlaps with the chromosomal location of a Morbid Map entry, an OMIM identification 

7 



WO 01/55301 



PCT/US01/01239 



number is disclosed in column 10 labeled "OMM Disease Reference(s)". A key to the 
OMIM reference identification numbers is provided in Table 5. 

[16] Table IB summarizes additional polynucleotides encompassed by the invention 
(including cDNA clones related to the sequences (Clone ID NO:Z), contig sequences 
(contig identifier (Contig ID:) contig nucleotide sequence identifiers (SEQ ID NO:X)), and 
genomic sequences (SEQ ID NO:B). The first column provides a unique clone identifier, 
"Clone ID NO:Z", for a cDNA clone related to each contig sequence. The second column 
provides the sequence identifier, "SEQ ID NO:X", for each contig sequence. The third 
column provides a unique contig identifier, "Contig ID:" for each contig sequence. The 
fourth column, provides a BAC identifier "BAC ID NO: A" for the BAG clone referenced in 
the corresponding row of the table. The fifth column provides the nucleotide sequence 
identifier, "SEQ ID NO:B" for a fragment of the BAC clone identified in column four of 
the corresponding row of the table. The sixth column, "Exon From-To", provides the 
location (i.e., nucleotide position numbers) within the polynucleotide sequence of SEQ ID 
NO:B which delineate certain polynucleotides of the invention that are also exemplary 
members of polynucleotide sequences that encode polypeptides of the invention (e.g., 
polypeptides containing amino acid sequences encoded by the polynucleotide sequences 
delineated in column six, and fragments and variants thereof). 

[17] Table 2 summarizes homology and features of some of the polypeptides of the 
invention. The first column provides a unique clone identifier, "Clone ID NO:Z", 
corresponding to a cDNA clone disclosed in Table 1A. The second column provides the 
unique contig identifier, "Contig ID:" corresponding to contigs in Table 1A and allowing 
for correlation with the information in Table 1 A. The third column provides the sequence 
identifier, "SEQ ID NO.X", for the contig polynucleotide sequence. The fourth column 
provides the analysis method by which the homology/identity disclosed in the Table was 
determined. Comparisons were made between polypeptides encoded by the polynucleotides 
of the invention and either a non-redundant protein database (herein referred to as "NR"), or 
a database of protein families (herein referred to as 'TFAM") as further described below. 
The fifth column provides a description of the PFAM/NR hit having a significant match to a 
polypeptide of the invention. Column six provides the accession number of the PFAM/NR 
hit disclosed in the fifth column. Column seven, "Score/Percent Identity", provides a quality 
score or the percent identity, of the hit disclosed in columns five and six. Columns 8 and 9, 
"NT From" and "NT To" respectively, delineate the polynucleotides in "SEQ ED NO:X" 
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that encode a polypeptide having a significant match to the PFAM/NR database as 
disclosed in the fifth and sixth columns. In specific embodiments polypeptides of the 
invention comprise, or alternatively consist of, an amino acid sequence encoded by a 
polynucleotide in SEQ ED NO:X as delineated in columns 8 and 9, or fragments or variants 
thereof. 

[18] The PFAM identification disclosed in Table 2, columns 5 and 6, communicates 
"both the function and enzymatic activity of polypeptides corresponding to the PFAM. 
Extensive documentation on PFAM families and individual members of these families are 
maintained in publicly accessible databases (see, for example the Sanger Centre PFAM web 
server at http://www.sanger.ac.uk/ for a searchable PFAM database). Using this 
information, and included links to PROSITE, SWISSPROT, GenBank, and other sequence 
databases, one can routinely assign an EC (Enzyme Commission) code to the polypeptides. 
The EC code consists of 4 integers separated by decimal points that are used to classify 
enzymes, and indicate important information about cellular function and enzyme 
mechanism. The first digit indicates a broad group of enzyme mechanism 
(i.e.l=oxidoreductases, 2=transferases). The second digit indicates the type of substrate the 
enzyme acts upon or a broad subcategory of the enzyme type (Le. EC 1.6 oxidoreductases 
acting on NADH or NADPH, or 5.1=racemases and epimerases, a subtype of EC 
5=isomerases). The third digit is used to distinguish further characteristics (EC 1.1.1 
oxidoreductases acting on the CH-OH group of donors with NAD or NADP as the acceptor, 
versus EC 1.1.2 where a cytochrome acts as the acceptor) or is simply assigned as 1 for the 
all entries where further clarification is unnecessary (all members of EC 4.1, carbon-carbon 
lyases are in group 4.1.1). The final number designates a specific enzyme, for instance, EC 
4.1.1.1 pyruvate decarboxylase, or EC 1.1.1.1 alcohol dehydrogenase. Thus, if all of the 
source sequences for the PFAM have EC codes of the form 1.1.3.X, where X is a positive 
integer, the polypeptide being evaluated is likely to have a similar EC code, and, in this 
example, will likely be an oxidoreductase acting on the CH-OH group of donors with 
oxygen as an acceptor. 

[19] Furthermore, knowledge of PFAM identification and/or EC code for a 
polypeptide communicates enzymatic activity of the protein. This activity can routinely be 
confirmed using or modifying assays known in the art. Additionally, these assays may 
routinely be applied or modified to evaluate the enzymatic activity of fragments and 
.variants of the invention. Further, these assays may routinely be applied or modified to 
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evaluate the ability of agonists or antagonists of the invention (e.g., agonistic or antagonistic 
antibodies) to enhance or reduce this enzymatic activity, respectively. 
[20] Table 3 provides polynucleotide sequences that may be disclaimed according to 
certain embodiments of the invention. The first column provides a unique clone identifier, 
"Clone ID", for a cDNA clone related to contig sequences disclosed in Table 1A. The 
. second column provides the sequence identifier, "SEQ ID NO:X", for contig sequences 
disclosed in Table 1A. The third column provides the unique contig identifier, "Contig 
H):", for contigs disclosed in Table 1A. The fourth column provides a unique integer V 
where 'a' is any integer between 1 and the final nucleotide minus 15 of SEQ ID NO:X, and 
the fifth column provides a unique integer 'b' where 'b' is any integer between 15 and the 
final nucleotide of SEQ ID NO:X, where both a and b correspond to the positions of 
nucleotide residues shown in SEQ ID NO:X, and where b is greater than or equal to a + 14. 
For each of the polynucleotides shown as SEQ ID NO:X, the uniquely defined integers can 
be substituted into the general formula of a-b, ;and used to describe polynucleotides which 
may be preferably excluded from the invention. In certain embodiments, preferably 
excluded from the invention are at least one, two, three, four, five, ten, or more of the 
polynucleotide sequence(s) having the accession numbers) disclosed in the sixth column of 
this Table (including for example, published sequence in connection with a particular BAC 
clone). In further embodiments, preferably excluded from the invention are the specific 
polynucleotide sequence(s) contained in the clones corresponding to at least one, two, three, 
four, five, ten, or more of the available material having the accession numbers identified in 
the sixth column of this Table (including for example, the actual sequence contained in an 
identified BAC clone). 

[21] Table 4 provides a key to the tissue/cell source identifier code disclosed in Table 
1A, column 8. Column 1 provides the tissue/cell source identifier code disclosed in Table 
1A, Column 8. Columns 2-5 provide a description of the tissue or cell source. Codes 
corresponding to diseased tissues are indicated in column 6 with the word "disease". The 
use of the word "disease" in column 6 is non-limiting. The tissue or cell source may be 
specific (e.g. a neoplasm), or may be disease-associated (e.g., a tissue sample from a 
normal portion of a diseased organ). Furthermore, tissues and/or cells lacking the "disease" 
designation may still be derived from sources directly or indirectly involved in a disease 
state or disorder, and therefore may have a further utility in that disease state or disorder. In 
numerous cases where the tissue/cell source is a library, column 7 identifies the vector used 
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to generate the library. 

[22] Table 5 provides a key to the OMIM reference identification numbers disclosed 
in Table 1A, column 10. OMIM reference identification numbers (Column 1) were derived 
from Online Mendelian Inheritance in Man (Online Mendelian Inheritance in Man, OMIM. 
McKusick-Nathans Institute for Genetic Medicine, Johns Hopkins University (Baltimore, 
MD) and National Center for Biotechnology Information, National Library of Medicine, 
(Bethesda, MD) 2000. World Wide Web URL: ht^://www.ncbi.nlm.nih.gov/omim/). 
Column 2 provides diseases associated with the cytologic band disclosed in Table 1A, 
column 9, as determined using (he Morbid Map database. 

[23] Table 6 summarizes ATCC Deposits, Deposit dates, and ATCC designation 
numbers of deposits made with the ATCC in connection with the present application. 
[24] Table 7 shows the cDNA libraries sequenced, and ATCC designation numbers 
and vector information relating to these cDNA libraries. 

[25]^ Table 8 provides a physical characterization of clones encompassed by the 
invention. The first column provides the unique clone identifier, "Clone ID NO:Z", for 
certain cDNA clones of the invention, as described in Table 1A. The second column 
provides the size of the cDNA insert contained in the corresponding cDNA clone. 

Definitions 

[26] The following definitions are provided to facilitate understanding of certain terms 
used throughout this specification. 

[27] In the present invention, "isolated" refers to material removed from its original 
environment (e.g., the natural environment if it is naturally occurring), and thus is altered 
<c by the hand of man" from its natural state. For example, an isolated polynucleotide could 
be part of a vector or a composition of matter, or could be contained within a cell, and still 
be "isolated" because that vector, composition of matter, or particular cell is not the original 
environment of the polynucleotide. The term "isolated" does not refer to genomic or cDNA 
libraries, whole cell total or mRNA preparations, genomic DNA preparations (including 
those separated by electrophoresis and transferred onto blots), sheared whole cell genomic 
DNA preparations or other compositions where the art demonstrates no distinguishing 
features of the polynucleotide/sequences of the present invention. 
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[28] As used herein, a "polynucleotide" refers to a molecule having a nucleic acid 
sequence encoding SEQ ID NO:Y or a fragment or variant thereof; a nucleic acid sequence 
contained in SEQ ID NO:X (as described in column 3 of Table 1A) or the complement 
thereof; a cDNA sequence contained in Clone ID NO:Z (as described in column 2 of Table 
1A and contained within a library deposited with the ATCC); a nucleotide sequence 
encoding the polypeptide encoded by a nucleotide sequence in SEQ ID NO:B as defined in 
column 6 of Table IB or a fragment or variant thereof; or a nucleotide coding sequence in 
SEQ ID NO:B as defined in column 6 of Table IB or the complement thereof For 
example, the polynucleotide can contain the nucleotide sequence of the full length cDNA 
sequence, including the 5* and 3 1 untranslated sequences, the coding region, as well as 
fragments, epitopes, domains, and variants of the nucleic acid sequence. Moreover, as used 
herein, a "polypeptide" refers to a molecule having an amino acid sequence encoded by a 
polynucleotide of the invention as broadly defined (obviously excluding poly-Phenylalanine 
or poly-Lysine peptide sequences which result from translation of a polyA tail of a 
sequence corresponding to a cDNA). 

[29] In the present invention, "SEQ ID NO:X" was often generated by overlapping 
sequences contained in multiple clones (contig analysis). A representative clone containing 
all or most of the sequence for SEQ ID NO:X is deposited at Human Genome Sciences, Inc. 
(HGS) in a catalogued and archived library. As shown, for example, in column 2 of Table 
1 A, each clone is identified by a cDNA Clone ID (identifier generally referred to herein as 
Clone ID NO:Z). Each Clone ID is unique to an individual clone and the Clone ID is all the 
information needed to retrieve a given clone from the HGS library. Furthermore, certain 
clones disclosed in this application have been deposited with the ATCC on October 5, 2000, 
having the ATCC designation numbers PTA 2574 and PTA 2575; and on January 5, 2001, 
having the depositor reference numbers TS-1, TS-2, AC-1, and AC-2. In addition to the 
individual cDNA clone deposits, most of the cDNA libraries from which the clones were 
derived were deposited at the American Type Culture Collection (hereinafter "ATCC"). 
Table 7 provides a list of the deposited cDNA libraries. One can use the Clone ID NO:Z to 
determine the library source by reference to Tables 6 and 7. Table 7 lists the deposited 
cDNA libraries by name and links each library to an ATCC Deposit. Library names contain 
four characters, for example, 4< HTWE " The name of a cDNA clone (Clone ID) isolated 
from that library begins with the same four characters, for example "HTWEP07". As 
mentioned below, Table 1A correlates the Clone ID names with SEQ ID NO:X. Thus, 
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starting with an SEQ ID NO:X, one can use Tables 1, 6 and 7 to determine the 
corresponding Clone ID, which library it came from and which ATCC deposit the library is 
contained in. Furthermore, it is possible to retrieve a given cDNA clone from the source 
library by techniques known in the art and described elsewhere herein. The ATCC is 
located at 10801 University Boulevard, Manassas, Virginia 20110-2209, USA. The ATCC 
deposits were made pursuant to the terms of the Budapest Treaty on the international 
recognition of the deposit of microorganisms for the purposes of patent procedure. 
[30] In specific embodiments, the polynucleotides of the invention are at least 15, at 
least 30, at least 50, at least 100, at least 125, at least 500, or at least 1000 continuous 
nucleotides but are less than or equal to 300 kb, 200 kb, 100 kb, 50 kb, 15 kb, 10 kb, 7.5kb, 
5 kb, 2.5 kb, 2.0 kb, or 1 kb, in length. In a further embodiment, polynucleotides of the 
invention comprise a portion of the coding sequences, as disclosed herein, but do not 
comprise all or a portion of any intron. In another embodiment, the polynucleotides 
comprising coding sequences do not contain coding sequences of a genomic flanking gene 
(i.e., 5' or 3' to the gene of interest in the genome). In other embodiments, the 
polynucleotides of the invention do not contain the coding sequence of more than 1000, 
500, 250, 100, 50, 25, 20, 15, 10, 5, 4, 3, 2, or 1 genomic flanking gene(s). 
[31] A "polynucleotide" of the present invention also includes those polynucleotides 
capable of hybridizing, undo: stringent hybridization conditions, to sequences contained in 
SEQ ID NO:X, or the complement thereof (e.g., the complement of any one, two, three, 
four, or more of the polynucleotide fragments described herein), the polynucleotide 
sequence delineated in columns 8 and 9 of Table 2 or the complement thereof, and/or 
cDNA sequences contained in Clone ID NO:Z (e.g., the complement of any one, two, three, 
four, or more of the polynucleotide fragments, or the cDNA clone within the pool of cDNA 
clones deposited with the ATCC, described herein), and/or the polynucleotide sequence 
delineated in column 6 of Table IB or the complement thereof. "Stringent hybridization 
conditions" refers to an overnight incubation at 42 degree C in a solution comprising 50% 
formamide, 5x SSC (750 mM NaCl, 75 mM trisodium citrate), 50 mM sodium phosphate 
(pH 7.6), 5x Denhardt's solution, 10% dextran sulfate, and 20 fig/ml denatured, sheared 
salmon sperm DNA, followed by washing the filters in O.lx SSC at about 65 degree C 
[32] Also contemplated are nucleic acid molecules that hybridize to the 
polynucleotides of the present invention at lower stringency hybridization conditions. 
Changes in the stringency of hybridization and signal detection are primarily accomplished 
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through the manipulation of formamide concentration (lower percentages of formamide 
result in lowered stringency); salt conditions, or temperature. For example, lower 
stringency conditions include an overnight incubation at 37 degree C in a solution 
comprising 6X SSPE (20X SSPE = 3M NaCl; 0.2M NaH 2 P0 4 ; 0.02M EDTA, pH 7.4), 
0.5% SDS, 30% formamide, 100 ug/ml salmon sperm blocking DNA; followed by washes 
at 50 degree C with 1XSSPE, 0.1% SDS. In addition, to achieve even lower stringency, 
washes performed following stringent hybridization can be done at higher salt 
concentrations (e.g. 5X SSC). 

[33] Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress background in 
hybridization experiments. Typical blocking reagents include Denhardt's reagent, 
BLOTTO, heparin, denatured salmon sperm DNA, and commercially available proprietary 
formulations. The inclusion of specific blocking reagents may require modification of the 
hybridization conditions described above, due to problems with compatibility. 
[34] Of course, a polynucleotide which hybridizes only to polyAf sequences (such as 
any 3' terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 
complementary stretch of T (or U) residues, would not be included in the definition of 
"polynucleotide," since such a polynucleotide would hybridize to any nucleic acid molecule 
containing a poly (A) stretch or the complement thereof (e.g., practically any double- 
stranded cDNA clone generated using oligo dT as a primer). 

[35] The polynucleotide of the present invention can be composed of any 
polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or DNA or 
modified RNA or DNA. For example, polynucleotides can be composed of single- and 
double-stranded DNA, DNA that is a mixture of single- and double-stranded regions, 
single- and double-stranded RNA, and RNA that is mixture of single- and double-stranded 
regions, hybrid molecules comprising DNA and RNA that may be single-stranded or, more 
typically, double-stranded or a mixture of single- and double-stranded regions. In addition, 
the polynucleotide can be composed of triple-stranded regions comprising RNA or DNA or 
both RNA and DNA. A polynucleotide may also contain one or more modified bases or 
DNA or RNA backbones modified for stability or for other reasons. "Modified" bases 
include, for example, tritylated bases and unusual bases such as inosine. A variety of 
modifications can be made to DNA and RNA; thus, "polynucleotide" embraces chemically, 
enzymatically, or metabolically modified forms. 
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[36] The polypeptide of the present invention can be composed of amino acids joined 
to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres, and may 
contain amino acids other than the 20 gene-encoded amino acids. The polypeptides may be 
modified by either natural processes, such as posttranslational processing, or by chemical 
modification techniques which are well known in the art. Such modifications are well 
described in basic texts and in more detailed monographs, as well as in a voluminous 
research literature. Modifications can occur anywhere in a polypeptide, including the 
peptide backbone, the amino acid side-chains and the amino or carboxyl termini. It will be 
appreciated that the same type of modification may be present in the same or varying 
degrees at several sites in a given polypeptide. Also, a given polypeptide may contain 
many types of modifications. Polypeptides may be branched, for example, as a result of 
ubiquitination, and they may be cyclic, with or without branching. Cyclic, branched, and 
branched cyclic polypeptides may result from posttranslation natural processes or may be 
made by synthetic methods. Modifications include acetylation, acylation, ADP- 
ribosylation, amidation, covalent attachment of flavin, covalent attachment of a heme 
moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent attachment of 
a lipid or lipid derivative, covalent attachment of phosphotidylinositol, cross-linking, 
cyclization, disulfide bond formation, demethylation, formation of covalent cross-links, 
formation of cysteine, formation of pyroglutamate, formylation, gamma-carboxylation, 
glycosylation, GPI anchor formation, hydroxylation, iodination, methylation, 
myristoylation, oxidation, pegylation, proteolytic processing, phosphorylation, prenylation, 
racemization, selenoylation, sulfation, transfer-RNA mediated addition of amino acids to 
proteins such as arginylation, and ubiquitination. (See, for instance, PROTEINS - 
STRUCTURE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton, W. H. 
Freeman and Company, New York (1993); POSTTRANSLATIONAL COVALENT 
MODIFICATION OF PROTEINS, B. C Johnson, Ed., Academic Press, New York, pgs. 
1-12 (1983); Seifter et afc Meth. Enzymol. 182:626-646 (1990); Rattan et al., Ann. N.Y. 
Acad. Sci. 663:48-62 (1992)). 

[37] ,! SEQ ID NO:X" refers to a polynucleotide sequence described, for example, in 
Tables 1 Aor 2, while "SEQ ID NO: Y" refers to a polypeptide sequence described in column 
6 of Table 1 A. SEQ ID NO:X is identified by an integer specified in column 4 of Table 1 A. 
The polypeptide sequence SEQ ID NO:Y is a translated open reading frame (ORF) encoded 
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by polynucleotide SEQ ID NO:X. "Clone ID NO:Z" refers to a cDNA clone described in 
column 2 of Table 1A. 

[38] "A polypeptide having functional activity" refers to a polypeptide capable of 
displaying one or more known functional activities associated with a full-length (complete) 
protein. Such functional activities include, but are not limited to, biological activity, 
antigenicity [ability to bind (or compete with a polypeptide for binding) to an anti- 
polypeptide antibody], immunogenicity (ability to generate antibody which binds to a 
specific polypeptide of the invention), ability to form multimers with polypeptides of the 
invention, and ability to bind to a receptor or ligand for a polypeptide. 
[39] The PFAM accession number disclosed in Table 2, column 6 provides a link, 
through publicly accessible databases (see, for example the Sanger Centre PFAM web 
server at http://www.sanger.ac.uk/, and included links to PROSITE, SWISSPROT, 
GenBank, and other sequence databases), to the associated EC code, or closely-related EC 
codes. As described above, EC codes provide a description of the biochemical reaction(s) 
catalyzed by an enzyme family. Based on the associated EC code(s), one can routinely test 
the polypeptides of the invention for functional activity (e.g. biological activity) using or 
routinely modifying assays known in the art and/or assays described herein. For example, 
one of skill in the art may routinely assay enzyme polypeptides (including fragments and 
variants) of the invention for activity using assays as described in Examples 38, 39, 46, 47, 
55, 60, 61, 62, and 65. Many other enzyme assays are known in the art, and may be useful 
for demonstrating activities of the polypeptides of the present invention. 
[40] "A polypeptide having biological activity" refers to a polypeptide exhibiting 
activity similar to, but not necessarily identical to, an activity of a polypeptide of the present 
invention, including mature forms, as measured in a particular biological assay, with or 
without dose dependency. In the case where dose dependency does exist, it need not be 
identical to that of the polypeptide, but rather substantially similar to the dose-dependence 
in a given activity as compared to the polypeptide of the present invention (i.e., the 
candidate polypeptide will exhibit greater activity or not more than about 25-fold less and, 
preferably, not more than about tenfold less activity, and most preferably, not more than 
about three-fold less activity relative to the polypeptide of the present invention). 
[41] Table 1 A summarizes some of the polynucleotides encompassed by the invention 
(including contig sequences (SEQ ID NO:X) and clones (Clone ID NO:Z) and further 
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summarizes certain characteristics of these polynucleotides and the polypeptides encoded 
thereby. 
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Pro-1 to Leu-6, 
His-25 to Arg-30, 
Lys-40 to Lys-46, 
Thr-95 to Asp-lOO, 
Gly-125 to Gly-130, 
Arg-139to Pro-144, 
Lys-179 to Met-187. 


His-14 to Pro-19. 


Ala-20 to Arg-25. 
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Leu-48 to Gln-54. 


Pro-45 to Ser-53, 
Ala-55 to Ala-63, 
Asp-130 to Leu-136. 


Pro-1 to Glu-10, 
His-60 to Arg-76, 
Pro-79 to Arg-85, 
Ala-95 to Ile-101, 
Glu-124toGlu-130, 
Lys-151 to Arg-158. 


Arg-1 toGly-7, 
Pro-19 to Cys-27, 
Leu-61 to Ala-72, 
Ser-90 to Ser-96, 
Thr-126 to Ser-143, 
Glu-167 to Gln-176, 
Ile-185 to Ser-193, 
Phe-249toPhe-256, 
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GIb-43 to Thr-58, 
Asn-74toHis-79, 
Gly-109.toTrp-114. 


Asp-1 to Ala-6, 
Pro-25 to Pro-30. 
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Arg-14 to Thr-21, 
Tyr-42 to Asp-47. 


Gly-1 to Gln-11, 
Ser-24 to Cys-33, 
Thr-37 to Gly-46, 
Thr-51 to Thr-63. 


Pro-10 to Pro-17, 
Cys-41 to Pro-50, 
Asn-64 to Arg-73, 
Ser-81toArg-87, 
Glu-93 to Pro-100. 


Leu-29 to Gly-40, 
Tyr-93 to .Ile-100. 


Tyr-28 to Val-37, 
Gln-39 to Met-44, 
Leu-52 to Asp-60. 






Pro-48 to Gly-53, 
Pro-88 to Ser-94, 
Gly-103 to Ser-108, 
Pro-141 to Gln-150. 


Pro-48 to Gly-53, 
Pro-88 to Ser-94, 
Gly-103 to Gly-1 11. 


Ser-39 to Thr-45, 
Thr-65toThr-71. 
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Pro-l to Ala-12. 
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Asp-49 to Gly-55, 
Pro-73 to Thr-80, 
Thr-98 to Phe-103. 


Asp-41 to Gly-47, 
Pro-65 to Thr-72, 
Thr-90 to Phe-95. 


Pro-51 to Phe-58. 
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Asp-1 to Gln-9. 


Cys-33 to Tyr-39. 




1629 


1066 




00 
00 

1 

CO 


y—t 
t> 
CO 
1 

wo 




o\ 


VO 
vo 

t-H 




905092 


669383 




HFTCG46 




VO 

wo 



101 



WO 01/55301 



PO7US01/01239 



000*00000 

OOOOt-Hi— 4 I*— I 

co co co co co co co co 



0000«— «OWO»Oi-HOV>OOOWOOOOt^»ncNCN 
t^t^O^O^^^OOV^V0 , 0 v 0r-iOOO0\Oi— fiO 
hVOh^Oh't^OOhOOOOOOfnOOOOMOOrH^VO 
^h0^ioVO(SCSO\^0\0!M(SHOG\OOHHH 

oor-ico^-^t»n*nuor-r-oooooooNcocovooooo 



r-H 




CO 




cm 




0 






T-H 




0 
t> 


L07 


0 


and 



CO 

o 

CN 
*— 1 10 
VO O 

1? 

II 



So 
00 T^- 



33 



~co~3 ^ 

50 ^ ^ 

H ^ 2 o 
«-» +5 o\ ^ 
o 00 CO 

t> 0\ V t-« 

H co < co 



vo 
o 



CN 
ON 



o 
vo 



to 
ON 
CN 
ON 



uo 



102 



WO 01/55301 



PCT/US01/01239 



602491 




i 


• 












1 




T-H 

t-h ( ;M-(Shhh h h h h 


AR089: 15,AR061: 6 
S0260: 1 andL0581: 1. 


>: 1,AR0< 
J: 5, S005C 

: 3, H03.81: 
• 2 S0031' 

UH0255: 
: 1, S0045: 

1,H0013: 

1.L0105: 
: 1, H0687 
: 1, S0142; 

1.S0152: 

1. 


: 2,AR0< 
5: 5, S005C 

3.H0381: 
•2 S0031 1 

1.H0255: 
: 1, S0045: 

1,H0013: 

1.L0105: 
: 1, H0687 
: I, S0142: 

1.S0152: 

1. 


T«o2^2ooOOoo 






Gly-24 to Lys-36. 




Ala-6toTyr-ll, 
Gly-21 to Lys-33, 
Pro-54 to Trp-61, 
Ala-69 to He-75. 




Asn-42 to Gly-47, 
Lys-55 to Ala-62. 
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His-40 to Asn-46, 
Ser-101 to Lys-107, 
Ile-179 to Arg-184, 
Trp-223 to Cys-230, 
Phe-300 to Phe-306, 
Lys-353 to Gly-360, 
Leu-477 to Arg-490. 




Gly-21 toPro-27, 
Gln-62 to Asp-67, 
Asn-117to Leu-124, 
Arg-131 to Phe-138. 
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Gly-21 to Pro-27, 
Gln-62 to Asp-67, 
Asn-117to Leu-124. 
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Glu-87 to Thr-94. 


Ser-13 to Leu-22, 
Phe-40 to Lys-45. 


Ser-13 to Leu-22, 
Phe-40 to Lys-45. 


Arg-1 toLys-9. 


Gln-59 to Ser-71. 


Ser-8 to Gln-14, . 
Asp-52 to Pro-63, 
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Asp-37 to fle-44, 
Asp-47 to Thr-52, 
Pro-80 to Asp-85, 
Ala-90 to Tyr-lOl, 
Asp-138 to Glu-146, 
Ser-154toPhe-161, 
Asn-172 to Gln-178, 
Gln-185toGlu-190, 
Asn-205 to Ser-215. 


Phe-8 to Asp- 18, 
Pro-71 to Arg-84, 
Arg-90 to Asp-97, 
Ser-125toLeu-133, 
Ala-137toGln-144, 
Met-181toGly-190, 
Gln-193 to De-199. 




Glu-33 to Thr-45, 
Arg-50 to Ser-59. 




Val-2 to Val-12, 
Asp-20 to Glu-26, 
Gln-56toGly-61, 
Gly-69 to Arg-76, 
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Val-41 to Asp-46, 
Met-64 to Arg-70, 
Arg-135 to Lys-146, 
Tyr-151 to Asn-157, 
Glu-167 to Ser-172. 


Val-19 to His-24, 
Gly-88 to Gly-93, 
Pro-156 to Arg-169. 


Leu-13 toTyr-18, . 
Tyr-108 toGly-113. 
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Arg-1 to Gly-8, 
Pro-18 to Gly-25, 
Thr-60 to Leu-67, 
Gly-107toThr-113. 


Ser-lOto Ser-16, 
Lys-226toTrp-231, 
Thr-288 to Ser-300. 


Ser-lOto Ser-16, 
Phe-89 to Ser-97. 
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Gly-33 to Asp-45, 
Ser-78 to Gly-85. 




Arg-98 to Thr-104, 
Gln-117toLys-122, 
Tyr-250 to Leu-262, 
Leu-294 to Phe-304, 
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Arg-59 to Thr-65, 
Gln-78 to Lys-83, 
Tyr-211 to Leu-223, 
Glu-257 to Lys-262. 


Arg-1 to Cys-9, 
Tyr-47 to Leu-59, 
Leu-91 toPhe-101, 
Gly-156 to Lys-164, 
Arg-179toPhe-190, 
Pro-369 to Ser-389. 
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Pro-39 to Gln-44, 
Pro-94 to Asp-101. 


His-6 to Asn-11. 


His-6 to Asn-11, 
Asp-74 to Ala-83, 
Asp-95 to Leu-101, 
Leu-108toSer-113. 
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Ala-770toGly-777, 
Trp-803 to Leu-808, 
Leu-833 to Trp-843, 
Ala-851 toThr-856, 
Gln-863 to Trp-873, 
Cys-883 toArg-901. 


Arg-8toLys-19, 
Gln-75 to Pro-84, 
Ser-112toSer-120. 




Glu-65 to Arg-72. 


Ser-36 to Lys-42, 
Ala-70 to Gln-86. 


Glu-28 to Phe-33, 
His-47 to Ser-53. 
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Met-1 to Glu-14, 
Thr-73 to Glu-81, 
Ala-86 to Ile-96. 


Glu-31 to Leu-36. 






Asp-l to Gly-9, 
Asp-86 to Glu-91, 
Pro-97 to Gly-103, 
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Gly-ltoThr-10, 
Ala-14toGly-19, 
Pro-52toVal-57, 
Pro-85 to Gln-95, 
Lys-198toHis-204, 
Pro-254 to Glu-260, 
Glu-269 to Ser-282, 
Glu-302 to Gly-307, 
Asp-320toAsp-326, 
Asp-373 to Ser-380, 
De-396 to Asp-407. 
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Gly-8 to Gly-13, 
Ala-76 to Ala-81, 
Arg-154'toGly-159, 
Arg-338 to Pro-349. 
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Gly-17toThr-26, 
Glu-93 to Asp-101, 
Arg-117 to Ala-125. 


Gln-10 to Thr-18, 
Ser-40 to Lys-47, 
Lys-59 toLys-64, 
Lys-73 to Leu-82, 
Asp-145 to Thr-160. 




Ala-27 to Ala-36, 
Glu-41 to Asp-48, 
Asp-84 to Lys-92, 
Ala-140 to Glu-145, 
Leu-168 to Glu-173, 
Gln-213 to Ser-218. 


Ala-27 to Ala-36, 
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Asp-5 to Arg-13, 
Thr-37 to Ser-45, 
Ser-131 toPro-137, 
Glu-154 toHis-160, 
Lys-162 to Arg-168, 
He-180to Asn-185. 


| Asp-5 to Arg-13, 
Thr-37 to Ser-45. 
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Asp-7 to Glu-28, 
Ser-42 to Asp-69, 
Gln-79 to Asp-102, 
Leu-105 to Cys-1 12, 
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Lys-457 to Pro-470, 
Gly-478 to Gln-483, 
Phe-519toCys-533. 


Lys-1 to Gly-14, 
Gly-23 to Met-43, 
Ala-87 to Pro-99, 
He-101 to Ile-121, 
Gln-126 to Val-135, 
Val-139 toCys-147. 
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Arg-86 to Gln-101, 
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Ser-77 to Leu-90. 

■ 




1167 


1707 


1168 


1708 


1169 




o 

■<* 

■ 


1 - 1551 


1 - 609 


vo 

r-H 

vo 
» 

On 

T-H 


i-H 

CO 
1 

CO 






© 

oo 


oo 
vo 

CN 


oo 
o 
oo 


ON 
VO 

CN 




997643 


883028 


1151518 


914561 


1164014 




HSIJI46 


HTFOK70 


HUSX071 




Of 


oo 
<n 

CN 


ON 

to 

CN 



174 



WO 01/55301 



PCT7US01/01239 



oo 
o 



vo vo 



^ i— » ^ co cn 
^ oo ^ \r> t> r< 
^ m U ^ 



33HSS-Sa 



I i-H *— ' *^ 1-4 r-H CN 

VO ON f» 22 CN ON O 

o o ^ ^ m ffi 
o o Q £=2 o o 

3 3 « w 3 3 § 



CN CN 



H h ^ o tn 

O ON OO ON i— i 

k £ m vo o 
o o o o o o o 

WWB WW 



^ on os 22 o t-H o 
<o oo nt io m o> 
vo t> vo *r\ 

O O O O o O Q 

>-lhlhlWt-lhl^ 



cn 

CN 
^ oo vo" 

VO Tfr 

Q |> ro 
o o 

t-\ oo 

VO H-l 
CN 



vo" cn cn ^ CS CN CN CN of 



r ^ VO 

^ m vo 



CN 



O ON ON 

^ S ^ ^ ^ , i 



CN VO 

O O h 

2 £ '*> 

o o o 

CO ^ 



ON 
CO 

s 



t^vo^coco^cNcN^^cN 

000 0 Q00 0 00p 



vo 



£6 
o o 

t-i ON 



in 

CO 



CO fo 

in 



o p o 



CN 
t 

6tA 



oo 



00. ;3 

33 



CN 00 ^ 

5 o o 
co 

o co vo 

■ t-H l — C 

I 1 I 

Ph co 00 

<c 6 < 



I-H CO 

^ <? 

00 co 

O O 
-t-» 

o <n 

O CN 

CN CN 



co o 

vo i=5 
co 

PL, £ 

co £ 

<! h4 
o o 

m co 
in o 
CN co 



o S 00 

o ^ o 
2 ^ 

^ T 

CO g 



oo 



ON 
O 

r> 



oo 
vo 



ON 
O 
OO 



ON 

vo 

CN 
VO 
oo 



o 



ON 
CO 
CN 
CN 
t 

CN 



O 
CN 



oo 

ON 

CO 
CN 
CN 



o 
vo 

CN 



175 



WO 01/55301 



PCT/US01/01239 



3 <N 

^ o 
^ r- o 
Q o o 



10 



o vo vo vo «o 
S o> ^ — 
in i5 \o m ^ vo 



5 o S 

w ^ ^ ^ ^oOgoOOOooQOooSSSooogooSSgo 



ON 
VO 



o<sOiooo^^voVOC^cMcocoooonvoooo 



2 £J 
cn o 



S: <s oo a 

^ r3 

S S vo 

UM 

2 o o o 

VO ^ CO OO 
vo ^ ^ 
v> vo VO vo 



<N 

i i 

0 Oh 

o c- 

p ro 

Si 

<3 



176 



WO 01/55301 



PCT/US01/01239 



<N 

S 



« <N 
CO 

o 

1 



OS 

o 
*o 
o 

Si 

25 oo 



on 

o o 
on a 



^ ^ 



<2> o 



o 
o 



• — ■ 

in 



1 — * 




oi 




t> 




vo 




o 




a 


T— < 




00 






oo 


o 


co 


o 


o U 


OH 


and 



VO <N . 
O ^ OO 



rfr ^ CO~ oT c^f CN CN <N , " H ^ 



" vo « 

g CO CO 

o o 
^ O oo 

^3 



^ On 
PP vo OS 

o o 10 



r-i 

r> in on 
pop 

h4 h-J 



oo r< t-i fo 
vo ^ ^- 

vo ^ CO >5 
o o o 2 



COCOc^cNofc^CN^ 



i 



O ^ oo 
Os vo ^ 

O O ^ 

Ed ffi oo 



<N £2 ^ rn 

VO ^ N 

o 2 vo 2 



O VO h 
h VO ON 

cs ^ , _ _ 
p o p p Q vo o 



^5 CO co 
cn in 

i i 

0 o o 

+-» +^ ^ 

on oo 

t-H "3" <r> 

1 i i 

o < o 



I f-H 

CO 

o £ 
2 o 

■ - » w <— — §j CO 

5 cm 5 <; a 



vo VO m 

« fNl CO ,1 

^ u o 

o ^ ^ 
^ OO ON O 

rH H (S 
J I > » 

O Pl 5 



On 

cn 

CO 



CN 
* CO 

CO T-H 

^ o 
o 

+* m 

OO CN 
CO i— • 

i— c t— i n 

© o 



o 

t-H 



CO 
I 

CN 



O 
OO 



CO 

o 
o 

r— I 

00 



CN 



OS 

»n 

o 
oo 



CN 
CN 



in 

Os 



CN 



o 

Os 
CO 
I 

CN 



CN 

T— 4 

oo 



vo 

1— I 
CO 
CO 
OS 



CO 



CN 
CN 
CO 



vo 



CO 

CN 



CN 
Os 
oo 

OS 



vo 

CN 



CN 

vo 

CN 



CO 

vo 
CN 



177 



WO 01/55301 



PCT/US01/01239 











; 1 and 




i; 58.AR054: 
.051: 41,AR089: 
i61: 0 

4: 2,H0265: 1, 
1, S0358: 1, 
: 1, T0039: 1, 
: 1, S0010: 1, 
: 1,H0263: 1, 
: 1,H0416: 1, 
: 1,L0796: 1, 




in ON <T\ CO ro t—i to Os rTk rv"i vn i— 4 vn ^t* CO 

^So^SoS^^^^^ vo ^ (N «n m 2; ^ 
oooOO°88ooooo HrH ooOO 


.Woggggg 
< kt> r-i ooCdSBtCBdrc 






Gly-92 to Pro-97, 
Cys-107 toGln-131, 
Pro-139toAla-147, 
Pro-149 to Arg-160, 
Thr-194 to Pro-206. 
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Glu-226 to Glu-232, 
Leu-257 to Ser-262. 


Gly-1 to Gly-6. 


Pro-tltoSer-20, 
Ala-35 to Pro-41, 
Gln-88 to Trp-95, 
Arg-llltoAsp-119. 


Val-20 to Gln-36, 
Arg-67 to Glu-78, 
Pro-154toPhe-159. 


Asn-1 to Gln-9, 
Arg-40 to Glu-51. 


Arg-6toGln-13, 
Thr-44 to Ser-50, 
Pro-145 to Asn-168. 
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Tyr-14 to Cys-23, 
Arg-41 to Lys-46, 
Ser-53 to Asp-74, 
Glu-106toGln-116, 
Ser-129 to Leu-135. 




Gly-40toVal-46, 
His-66 to Ser-72, 
Trp-83 to Gly-88, 
Trp-143 to Gly-149. 


Gly-38 to Val-44. 


Ala-76 to Gly-82, 
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Glu-126toAla-132. 








Glu-40 to Trp-57, 
Tyr-59 to Phe-64, 
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Asp-106 to Arg-114. 


Val-22 to Asp-27, 
Gly-37 to Gln-42, 
Thr-48 to Glu-54, 
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Glu-25 to Trp-33, 
Trp-76 to Gln-83, 
Pro-94 to Asp-108. 


Glu-25 to Trp-33, 
Trp-76 to Gln-83, 
Pro-94 to Asp-108. 


Glu-4 to Ser-9, 
Ser-58 to Arg-65. 
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Phe-62 to Arg-67, 
Gln-92 to Leu-104. 




Ser-50 to Ser-66. 


Glu-38 to His-43, 
Arg-58 to Thr-68. 
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Asp-l to Arg-8, 
Lys-15 to Asn-20, 
Thr-74 to Leu-80, 
Pro-84 to Asp-90. 


Thr-7 to Leu-13, 
Pro-17 to Asp-23, 
Ala-180 to Arg-188. 


Thr-7 to Leu-13, 
Pro-17 to Asp-23, 
Ala-180 to Arg-188. 


Tyr-5 to Thr-14, 
His-61 to Asn-70. 


Tyr-6 to Thr-13, 
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His-60 to Asn-69. 


Tyr-101 to Glu-108, 
Pro-110toArg-116, 
Tyr-158 to Gln-164. 


Tyr-101 to Glu-108, 
Pro-110toArg-116, 
Tyr-158 to Gln-164. 


His-8 to Gly-18. 


His-8 to Gly-18. 






Leu-59 to Gln-64. 




Lys-1 to Ile-6, 
Pro-28 to Glu-37, 
Leu-58 to Arg-65, 
Pro-95 to Glu-102, 
Arg-104 toGly-111, 
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Ala-46 to Pro-68, 
Gln-75 to Gly-84, 
Leu-106toGly-121, 
Pro-208 to Lys-214. 


Phe-3 to Cys-8, 
Ser-64 to Gln-69. 




Phe-3 to Cys-8, 
Ser-64 to Gln-69. 


His-27 to Thr-32. 


Glu-140 to Trp-147, 
Asn-323 to Glu-329. 
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Gly-22toPhe-27, 
Tyr-36 to Ala-48, 
Glu-51 to Pro-79, 
Pro-102 to His-113. 
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Lys-50 to Gly-56, 
Pro-114toGly-122, 
Glu-129toTyr-134, 
Ala-174 to Leu-179, 
Arg-210 to Tyr-222. 




1243 




3-686 




ro 




927125 




HMCBN45 




CO 
CO 
CO 



210 



WO 01/55301 



PCT/US01/01239 



O of 

H ., 

vo m oo 
Ohm 

^ \n f-H 



£ 

5> vo 

i> vo 

o o o 

W iJ hJ 



^ Ch C\ 

*h r- ^ 
t — c — - 
o o o 

►J lj 1^ 



CN CN O 

*r> vo i-H 

r-- co cn 

q o o 



t o6 

5 ^ ^ 
^ m 

2 o q 
B co J 



N m oo 
5 O «T) ^ 

S o o 2 



3 ^ s 

CM VO 2 

o o ^ 



w w § 3 2 



*0 r-H 

«n On 

vo 

o o 
^ k4 



66 



vo ^ p 
»o >n I s 
r> 

o o o 
^ -I K 



i-Hoor^ir>^*o^^^cocococococsfc>f^^^cN~c^^ 



ON 
OO 



vo - 
vo O 

o ^ ^; 

r> o 

^^^^^^^^^ 



m h h m oo ^ 
o co vo o ^J- On 
oo o oo 



o r-i vo . vo m 

in vr> i> o cn ^ 

h h oo h h 

q q q q o q 



ovvo^jvoo^oiooo^r^ 
^t^r-*>$Q£!^ovor-o 
t> co ^f* ^ S >P -r-i oo vo 
qoogogoqqqq 



co ^ 
° 2 



vo co vo 
>n (N 
o 

q q o 

00 



oo 

I 

oo 

CO 

I 



o 

00 

H 

C\ CN 
00 

A i 



3 ^ 
2 B 

co 

co in 

<; oo 



?z co 



CN 

< o 



o 



vo 
vo 



CN CN 
O 



5 

CN 



OO 
CN 



CN 

in 
co 



5 



ON 

co 
<N 
CM 



co 

CO 



211 



WO 01/55301 



PCT/US01/01239 



ooo§o§ogoSoooo§ogg§o§SSoo§ooSg§ 

00 0 0 000 0 0 0 0000000 00 0 OOo 00 OOOoOO 



212 



WO 01/55301 



PCT/US01/01239 



O O O O O ON <N 
OOOOfOHMTt 

vo vo oo o <n o\«o 
!> on oo o t-< 
fO OO h ^ o o 
i-h <M csj VQ VO 



oooooooo2°22°Po° 
^^^^hJ^^ffi ^^^^^^ ™ ^ 

OOOOOOOOOOOOOoogo 



° ^ 

o g 
CO 



1 



<N 
I 



vo 



ON § 
O o 



1 



a, 

5v 



o 

o 2 



CO 

00 

o 

p 



I 



to 



vo 
to 



00 

VO 



to 



CO 
CO 

r- 
to 



o 

£ 2 

00 vj 

.S3 
0 s 



vo 



CO 

as 

I 



vo 

00 



00 
vo 

1 — c 

o 
10 

0\ 



vo 



CN 



CO 



vo 



10 
o\ 

00 
10 
«o 



»o 

m 

(O 



vo 

CO 
CO 



213 



WO 01/55301 



PCT/US01/01239 











s 




7: 4, L076< 
3, L0439: 
: 2, L0809: 

1, H0656: 
: 1,H0051 
1, H0272: 
1, L0805: 
1,L0519: 
: 1,L0779: 
1, L0584: 
1. 


^^ooooooSSooooooOo 

^ H .H|WHlHlHlWWWwHlHlHlHlHlHlhl 




Rh^oooooooQooooooOo 


Glu-34 to Ser-39, 
His-59 to Asn-64. 


His-8 to Arg-13, 
Ser-23 to Lys-30. 
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His-1 to Thr-6, 
Pro- 14 to Trp-21, 
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Ile-199 to Gly-204. 
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Pro-56 to Arg-67, 
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Lys-35 to Gln-40, 
Gln-61 to Lys-66, 
Ser-116toGly-121, 
Gln-192 to Ser-205. 
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Asp-125 to Gln-135. 


Pro-8 to Gly-17, 
Arg-22 to Ser-29. 






1355 


1356 


1357 




93-533 


3 - 500 


68-316 




Uo 




t~- 




880328 


882939 


839516 




HGBCU40 


HE9PR39 


HTEAF36 




5 


5 


5 



261 



WO 01/55301 



PCT/US01/01239 



* ~ ~ 

^ r-» o 



oo . . 

10 ir> 

ro *n 

O O 

00 o 

VO * 



00 00 o 

^ ^ M 10 On 
^ S ^ 2 ^ 



CO 



co co co co co 



CO 



co ci CN 

°° vo £ S 
£ vo vo 
O O O 

»-< »-h i—» 00" 00" ocf ^ ^ vcT vo ^ in in *f **t xf ro rn rn co" cn of of 



i—i VO t"""* 

O CO 

vo CO vo 

o o q 
^ CO 



vo ^ 

Co ^ ^ 

o o o 

h-l W H 



o vo 
00 <o 

CO CO 

o o 
00 00 



vo 

00 o 
m vo 
o o 

a a 



(N ts m 
00 
vo vo 
o o o 

^ 3 a 



00 r- 

22 s S 

„ ^ _ . s s I 

go a a a a a >_3 



CO O 

*n cs co 

CN VO O 
OOO 



vo »o vo «o CO 

VO t> CO vo 

O C> O CI <z> 

^ ^ 00 kJ| ,_| 



^J- CO 00 

io VO i-H 

vo vo 

o o o 

3 a a 



vo VO 
^- vo ^ 
VO vo ^ 



ooooooooooo^o^oo 

a^aaa^SSaaaaaSaSaa 



262 



WO 01/55301 



PCT7US0 1/01239 



<n <n cs <n <s cs cs rsf ^ of of of 7! « '<s °* 71 J ^ 71 71 ^ 7! 7! 7! 71 ^ 7* 71 

of of °^ ^ ^ ^ ^ °^ of CS of ^ ^ of ci of of i-T 1-* *-* *~* i-T 

00 000000 OOOOOo°OOOOoOOOoOOOOoOO 



263 



WO 01/55301 



PCTAJS01/01239 



- j „- „• j ~- „■ j „- 3 

S 0 S2 0 oooooooooooo2°222o°ooooooo 



m o >n 



222p2ooooooqooooogoOoo 0 o 0 oooooo 



VO AO O 



264 



WO 01/55301 



PCT7US01/01239 



<N 
CO 

cr 



(N 



a\ cn 

00 oo o 
o *— i o 

of 



i 



o 
o 
vo 
o 

j=1 



311 
3 & 



VO 



cocoCScNc>fcN^«^t-H>-< , " H, "~''— • 



3 «> S ^ N 

£j O t> ^ u-> 
o o o o o o 



VO CN CN VO ^> O CO 

^ ^ ^ 23 S S 2 

CO ^ CN O O VO 0 

° ° ° 2 S S 2 

oo oo co fxj W ffl ffi 



co vo vo o o Os 

h vo m ^ S 

o o o o o o 

gCj S gg ffi i— j h^I 



»n o o> H ^ o 

N'nvoo too 
O O O o o O o 

w w w h a b t- 



oo~ ,-T 

^ VO ON 
$> CJ w 

fi < a 

2 2 2 

h h m 

CN VO CO 
i i i 

333 



O co 



3«£ 

~ ^ m 

ON t-. 1* 



OS VO CO 

■S £ £ 

o 



^ to 



O O 

o 

^ oo oo 
H m h 

I r I 
3 00 0 

o < o 



O 



ON 



^ -H "H 



o 



1 — I 



in On 
co <n 

r-» H 
I I 

oo a 



oo O 

0 o 

■+-» +-» 

CN CN 

• CN 

1 i 

P > 



« CO CN 

co V ^ 
l-l o o 

o -r ~ 

-«-> 0\ <-H 

■<* CS CN 

w V «? 
o £ 



vo o\ t> 

co ^ 

CN CN 

-2-2 3 

t-h oo o 

co oo CN 

CN CN CO 

2 



^ "f 

2 £ 
pm O 

2 2 

OO O 
ON 



CN 



CO 

«n 

CO 



ON 

*n 

co 



ON 
VO 
VO 



»n 
OO 

o 

i— I 

I 

CO 



vo 
o 

ON 



CN 
VO 
CO 



CN 

oo 



oo 



ON 

m 



in 
On 

< 

CO 

oo 
oo 



oo 

CO 

oo 
oo 



oo 

00 
00 
00 



vo 
vo 

I 



oo 

5 



0\ 

> 



ON 

3 



265 



WO 01/55301 



PCMJS01/01239 







>.1 


'4 
















1 




vo 

r—i 


T— ( 


: 1 and 




..VO ~ „ ^ « . « • . * ^ „ 

f-H ..^^-rococN^cSoJOJ 


: 1, H0271 
: 1, S0250: 
: 1, H0634 
• 1 S0386- 
1.H0641: 


1, L0766: 


1,L0565: 
: 1, S0152: 
: 1, L0754: 

1, L0605: 
1.H0542: 
: 1, H0422 




o 

%s 

-co 

<M CM 


1, S0360: 


: 1, H0040 


i—i 


S S3 £ * 22 £ oo s ; ^> «> 
..^t^co^co^^cscn 


S^^^Ococ^JQ^^vovo^^J 


00 *f <» £ 

o £ 


H0522 


gg to 3; vb o o j¥ oo vo 




Ser-14 to Trp-22. 


Arg-147 to Asn-153, 
Arg-165 to Glu-174, 
Phe-217 to Lys-222, 
Ala-306toSer-313. 
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Gly-11 toLys-20, 
His-41 to Cys-47, 
Thr-82 to Lys-90. 
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Ser-2 to Gln-22, 
Pro-94toHis-110, 
Phe-167toAla-172, 
Leu-261 to Gly-268. 
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Arg-ll to Arg-16, 
Gln-34 to Arg-40, 
Ser-119toGln-126, 
Lys-147toGly-157. 
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Gln-21 to Trp-32, 
Lys-81 to Leu-86, 
Pro-100 to Cys-107. 
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Asp-41 to Cys-50, 
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Asn-89 to Glu-96, 
Glu-113toGln-119. 
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Pro-133 to Lys-142, 
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Val-216toTyr-221. 
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Gly-8toPhe-18, 
His-26 to Phe-41, 
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Thr-67 to Arg-72, 
Val-87 to Ser-98, 
Glu-170 to Gly-176, 
Lys-190 to Asp-200. 


Arg-6toGlu-12, 
Tyr-30 to Thr-35, 
Val-42 to His-52. 


Ala-1 to Ser-7, 
Gln-31 toLeu-46r 
Arg-49 to Glu-55, 
Tyr-73 to Asp-79. 


Ala-8 to Pro-23, 
Ala-25 to Pro-30, 
Arg-46 to Glu-53. 
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Asp-5 to Lys-13, 
Gly-107toCys-113, 
Thr-125toLeu-131, 
Lys-146 to Asp-155, 
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